An Error-Oriented Approach to Word Embedding Pre-Training
نویسندگان
چکیده
We propose a novel word embedding pretraining approach that exploits writing errors in learners’ scripts. We compare our method to previous models that tune the embeddings based on script scores and the discrimination between correct and corrupt word contexts in addition to the generic commonly-used embeddings pre-trained on large corpora. The comparison is achieved by using the aforementioned models to bootstrap a neural network that learns to predict a holistic score for scripts. Furthermore, we investigate augmenting our model with error corrections and monitor the impact on performance. Our results show that our error-oriented approach outperforms other comparable ones which is further demonstrated when training on more data. Additionally, extending the model with corrections provides further performance gains when data sparsity is an issue.
منابع مشابه
Efficient learning for spoken language understanding tasks with word embedding based pre-training
Spoken language understanding (SLU) tasks such as goal estimation and intention identification from user’s commands are essential components in spoken dialog systems. In recent years, neural network approaches have shown great success in various SLU tasks. However, one major difficulty of SLU is that the annotation of collected data can be expensive. Often this results in insufficient data bein...
متن کاملThe effectiveness of skills training based on emotion-oriented approach on anxiety sensitivity and emotion control of women affected by extramarital relationships
The purpose of this research was to determine the effectiveness of teaching skills based on an emotion-oriented approach to anxiety sensitivity and controlling the emotions of women affected by extramarital relationships. This research was a semi-experimental type with a pre-test, post-test, and control group with a follow-up phase for two months. The statistical population included all women a...
متن کاملA New Document Embedding Method for News Classification
Abstract- Text classification is one of the main tasks of natural language processing (NLP). In this task, documents are classified into pre-defined categories. There is lots of news spreading on the web. A text classifier can categorize news automatically and this facilitates and accelerates access to the news. The first step in text classification is to represent documents in a suitable way t...
متن کاملEffectiveness of Mindfulness Oriented Recovery Enhancement Approach on Attentional Bias and Disability in Chronic Pain Patients
Aims and background: Selective attention to pain-related stimuli, known as pain attentional bias (AB) can exacerbate pain, disability and undermine quality of life. The aim of this study was to determine effectiveness of mindfulness oriented recovery enhancement approach on attentional bias related to pain and disability among Chronic Pain Patients. Materials and methods: The present study was...
متن کاملPhishing website detection using weighted feature line embedding
The aim of phishing is tracing the users' s private information without their permission by designing a new website which mimics the trusted website. The specialists of information technology do not agree on a unique definition for the discriminative features that characterizes the phishing websites. Therefore, the number of reliable training samples in phishing detection problems is limited. M...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017